Uniform Glivenko-Cantelli Theorems and Concentration of Measure in the Mathematical Modelling of Learning
نویسنده
چکیده
This paper surveys certain developments in the use of probabilistic techniques for the modelling of generalization in machine learning. Building on ‘uniform convergence’ results in probability theory, a number of approaches to the problem of quantifying generalization have been developed in recent years. Initially these models addressed binary classification, and as such were applicable, for example, to binary-output neural networks. More recently, analysis has been extended to apply to regression problems, and to classification problems in which the classification is achieved by using real-valued functions (in which the concept of a large margin has proven useful). In order to obtain more useful and realistic bounds, and to analyse model selection, another development has been the derivation of datadependent bounds. Here, we discuss some of the main probabilistic techniques and key results, particularly the use (and derivation of) uniform Glivenko-Cantelli theorems, and the use of concentration of measure results. Many details are omitted, the aim being to give a high-level overview of the types of approaches taken and methods used.
منابع مشابه
Preservation Theorems for Glivenko-cantelli and Uniform Glivenko-cantelli Classes Aad Van Der Vaart and Jon
We show that the P Glivenko property of classes of functions F1; : : : ;Fk is preserved by a continuous function ' from R k to R in the sense that the new class of functions x! '(f1(x); : : : ; fk(x)); fi 2 Fi; i = 1; : : : ; k is again a Glivenko-Cantelli class of functions if it has an integrable envelope. We also prove an analogous result for preservation of the uniform Glivenko-Cantelli pro...
متن کاملUniform Glivenko–Cantelli Classes
A class of sets, or functions, is said to be P–Glivenko–Cantelli if the empirical measure Pn converges in some sense to the true measure, P , as n → ∞, uniformly over the class of sets or functions. Thus, the notions of Glivenko–Cantelli, and likewise uniform Glivenko–Cantelli are for the most part qualitative assessments of how “well–behaved” a collection of sets or functions is, in the sense ...
متن کاملPreservation Theorems for Glivenko-Cantelli and Uniform Glivenko-Cantelli Classes
We show that the P−Glivenko property of classes of functions F1, . . . ,Fk is preserved by a continuous function φ from R to R in the sense that the new class of functions x → φ(f1(x), . . . , fk(x)), fi ∈ Fi, i = 1, . . . , k is again a Glivenko-Cantelli class of functions if it has an integrable envelope. We also prove an analogous result for preservation of the uniform Glivenko-Cantelli prop...
متن کاملA Counterexample Concerning the Extension of Uniform Strong Laws to Ergodic Processes
We present a construction showing that a class of sets C that is Glivenko-Cantelli for an i.i.d. process need not be Glivenko-Cantelli for every stationary ergodic process with the same one dimensional marginal distribution. This result provides a counterpoint to recent work extending uniform strong laws to ergodic processes, and a recent characterization of universal Glivenko Cantelli classes.
متن کاملSome Converse Limit Theorems for Exchangeable Bootstraps
The bootstrap Glivenko-Cantelli and bootstrap Donsker theorems of Giné and Zinn (1990) contain both necessary and sufficient conditions for the asymptotic validity of Efron’s nonparametric bootstrap. In the more general case of exchangeably weighted bootstraps, Praestgaard and Wellner (1993) and Van der Vaart and Wellner (1996) give analogues of the sufficiency half of the Theorems of Giné and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002